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Abstract 

We develop a decision tree based game-theoretical approach for constructing functional responses in multi-prey/multi- 
patch environments and for finding the corresponding optimal foraging strategies. Decision trees provide a way to describe 
details of predator foraging behavior, based on the predator's sequence of choices at different decision points, that 
facilitates writing down the corresponding functional response. It is shown that the optimal foraging behavior that 
maximizes predator energy intake per unit time is a Nash equilibrium of the underlying optimal foraging game. We apply 
these game-theoretical methods to three scenarios: the classical diet choice model with two types of prey and sequential 
prey encounters, the diet choice model with simultaneous prey encounters, and a model in which the predator requires a 
positive recognition time to identify the type of prey encountered. For both diet choice models, it is shown that every Nash 
equilibrium yields optimal foraging behavior. Although suboptimal Nash equilibrium outcomes may exist when prey 
recognition time is included, only optimal foraging behavior is stable under evolutionary learning processes. 
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Introduction 

The functional response [1,2] considers the number of prey (or 
resource items) consumed by a single predator (or forager) as 
influenced by prey abundance. By dictating the mortality rate of 
prey and the feeding rate of predators, it is central to 
understanding consumer-resource dynamics [3,4]. Furthermore, 
the functional response can be extended to consider a predator 
seeking two prey types [1]. Besides being more realistic for many 
predators, functional responses on two food types create indirect 
effects between the prey via the shared predator. For instance, if 
consuming a prey item takes time or reduces motivation, then the 
presence of a second food type decreases the forager's consump- 
tion of the first food type. Via the functional response, such prey 
become indirect mutualists [5]. Conversely, short-term apparent 
competition [6,7] results if the presence of the second prey 
encourages the predator to spend more time or effort searching for 
and capturing prey. This happens when foragers bias their efforts 
towards areas rich in resources. Regardless, the two-food 
functional response is central to understanding diets, optimal 
foraging for multiple resources, predator mediated indirect effects 
between prey, and population dynamics within food webs. 

Two modeling approaches have addressed the question of diet 
choice for a forager that searches for and then handles 
encountered prey items. The first is found in classic optimal 
foraging models. The forager's encounter probability or attack rate 
[3] is viewed as a mass action phenomenon between the predator 
and its prey. The forager's overall encounter rate with prey is 



simply the product of prey abundance and the predator's 
encounter probability on that prey. Upon encountering a prey, 
the forager can elect to consume the prey at some handling time 
cost, or reject the opportunity and continue the search for other 
prey. Starting from HoUing's [1] two-food functional response this 
approach has generated increasingly sophisticated predictions. 

In PuUiam [8] (see also [9]), a "zero-one" or "bang-bang" rule 
for diet choice was derived. A forager should either always accept 
or always reject an encountered food item. When encountered, the 
preferred food (based on a higher reward to handling time ratio) 
should always be consumed. If searching for and handling the 
preferred food type yields more (or less) reward than simply 
handling the less preferred food, then the less preferred food 
should always be rejected (or accepted) when encountered. 
Empirical support was encouraging but equivocal [10]. Most 
foragers show a partial selectivity, they are neither completely 
opportunistic nor completely selective. A number of mechanisms 
have been proposed and modeled for why foragers sometimes only 
partially consume a less preferred food; including food depletion 
[11], food bulk and digestion limitations [12], complementary 
nutrients [13], local omniscience [14], incorrect prey classification 
and sampling by predators [15,16], prey crypsis [17] etc. 

A second approach to diet choice is emerging from spatially- 
explicit models such as agent based models. A forager may move 
through a lattice or some form of continuous space. Prey items 
may occur at frxed locations or may also move through the defined 
space. The forager possesses some detection radius. Upon 
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detecting a prey, the forager can choose to ignore the prey or 
attempt a capture. Such approaches lead to greater realism by 
considering the roles of space and individual contingencies. While 
they move through the same landscape, each individual forager 
becomes more or less unique based on its own personal history- of 
movement, food encounters, and foraging decisions. Some 
individuals may experience unusually high or low harvest rates 
as a consequence of runs of good or bad luck, respectively. like the 
classical models of diet choice, the foragers can still make optimal 
foraging decisions by deciding which encountered foods to handle 
or reject. The simulations can be run with a myriad of decision 
rules, and the performance of these rules can be compared. While 
a best diet choice rule may emerge from a particular scenario, the 
explicit nature of the agent based models may obscure the 
elegance or simphcity of the decision rule. Such agent based 
models may approximate more or less the optimal decision rules 
from the first approach to diet choice [14]. 

Here we develop a decision theory approach to diet choice. We 
use an explicit decision tree to evaluate the costs and benefits of 
different choices. Such a decision tree has similarities to extensive 
form games from game theorv' [18,19]. Our goals are threefold. 
First, does an explicit consideration of decision making recover the 
results from the classic "mass-artion" models of diet choice. 
Second, can these decision trees assist in uncovering the optimal 
decision rules for agent-based foraging models. Third, what are 
the similarities and difiFerences between the decision tree of a 
forager and evolutionary games in extensive form. To achieve 
these goals we imagine a forager that searches for and handles 
food items of two types. 

We consider three different scenarios based on the nature of 
searching for food and the ability to recognize a food's type upon 
encounter. In the first, search is undirected in terms of food type, 
but upon encountering a food item the forager instantly recognizes 
its type. This accords with the assumptions that generate HoUing's 
two-food functional response and an "all or nothing" decision rule 
of food type acceptability. In the second the forager may 
encounter one prey of each type (called simultaneous encounter 
[20]), but can only handle one of the items, the other being lost. 
For instance, these two prey may be together at the same place 
competing over a common resource. Alternatively, the predator 
may search a small area completely for any prey before deciding 
whether to attack. In the third, we consider recognition time where 
the forager must expend additional time if it wants to know the 
type of food that has been encountered prior to handling. 

Methods 

Decision trees and the functional response for two prey 
types 

In this section, wc de\ elop a decision trc(' m(^th()d to derive the 
predator's functional rc'sponse. The tree details the predator-prey 
interactions under consideration. We envision several prey types 
spatially distributed among many patches (that we wiU call 
microhabitats). The encounter events are then partially deter- 
mined by the prey through their spatial distribution before the 
predator arrives. For instance, if prey are territorial, then the 
predator can encounter at most one solitary prey in a given 
microhabitat. At another extreme, if the different types of prey 
aggregate, then the predator can encounter different prey types at 
the same time. Thus, encounter events depend on the spatial 
behavior of the prey. 

We break the predation process into different stages. A typical 
predation process has at least three stages that answer the 
following questions: 1. What prey (or types of prey) does the 



predator encounter? 2. What does the predator do in a given 
encounter situation (e.g. does the predator attack a prey, what type 
does it attack, etc.)? 3. Is the predator successful or not if it attacks? 
Here, we construct functional responses from the underlying 
decision trees based on three scenarios. This construction is, 
however, quite general and described fully in section Decision 
trees and the functional responses of Appendix SI. We start with a 
well known example that leads to the HoUing type II functional 
response for two prey types. 

Suppose that there are two types of prey A and B with fixed 
densities x and y, respectively. We assume that these prey are 
scattered randomly among N microhabitats where N is much 
larger than the number of individuals (i.e., N»x+y). Thus, there 
will be at most one prey in each microhabitat (i.e. the probability 
that there are two or more in some microhabitat is negligible). 
Thus, the probabilities that a given microhabitat has no prey is 
po = l — — jf, exactly one prey A is ff and exactiy one prey 
B k Pb= J/- These probabilities are assumed not to change with 
time, which is the usual assumption when deriving a functional 
response. 

Suppose the predator chooses a microhabitat to search at 
random, that it always finds the prey in this microhabitat if there is 
one, and that it takes a searching time t, for it to determine 
whether a prey is there or not. There are then three possible 
encounter events: the predator encounters a prey of type A, a prey 
of type B, or no prey at all. These events occur with probabilities 
Pa, Pb and po respectively (see Figure 1, Level 1). 

For the first event when encountering prey A, the predator has 
two possible actions: Either "attack prey A" and "do not attack 
prey A." These actions occur with probabilities qA and l—qA, 
respectively (see Figure 1, Level 2). Similarly, in the second event 
when a predator encounters prey B, the two possible actions of the 
predator are to "attack prey .B" and "do not attack prey B" with 
probabilities qg and 1 — (/b, respectively. For the third event, when 
no prey are found, the only predator action is "do not attack" with 
probability 1 . Altogether, there are five possible predator activities, 
and these correspond to the five edges at Level 2 in the decision 
tree of Figure 1. 
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Figure 1 . The decision tree for two prey types. The first level gives 
the prey encounter distribution. The second level gives the predator 
activity distribution. The final row of the diagram gives the probability 
of each predator activity event and so sum to 1. Since each entry here is 
simply the product of the probabilities along the path leading to this 
endpoint, we do not provide them in the decision trees from now on. 
With random prey distribution and N large, pq = \ — jj — j^j ,pa = fj and 
Pb= jj- If prey A is the more profitable type, the edge in the decision 
tree corresponding to not attacking this type of prey is never followed 
at optimal foraging (indicated by the dotted edge in the tree). The 
reduced tree is then the resulting diagram with this edge removed. 
doi:1 0.1 371/journal.pone.0088773.g001 
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Let the predator's hancQing times of prey A and B be tah and 
ZBh, respectively. The five predator activity events are: encounter a 
microhabitat with prey A and attack it; encounter a microhabitat 

with prey A and do not attack it; encounter a microhabitat with 
prey B and attack it; encounter a microhabitat with prey B and do 
not attack it; encounter an empty habitat. The probability 
distribution of these activities (i.e. the "activity distribution", 

N N 
respectively. AU this 



[21]) in this order is PaIa- 



-qB)-- 



Po- 



with dura- 



N 

tion times Zs + T^Ah:, T^s, T^s+T^Bh, '^S' 

information is included in the decision tree of Figure 1. Also 
included in this tree are the energy consequences (n() to the 
predator of each of the five activities. 

Calculation of functional responses is based on renewal theory 
(for details, see section Decision trees and the functional responses 
of Appendix SI) which proves that the long term intake rate of a 
given prey type can be calculated as the mean energy intake 
during one renewal cycle divided by the mean duration of the 
renewal cycle [20,22-24]. A single renewal cycle is given by a 
predator passing through the decision tree in Figure 1 . Since type 
A prey are only killed when the path denoted by Pa and then qA is 
followed, the functional response to prey A, fAilA^lB), is given 
through Figure 1 by 



fA{qA,qB)-- 



PAqA 



PO-Cs +PA [qA{Zs + T^Ah) + (1 - qA)Zs] +PB[qB{Zs + XBh) + (1 - 

PAqA 

+ Pa qA lAh + PBqB^Bh 

Similarly, the functional response for prey B is 

PBqB 



fBiqAm)-- 



+PAqA'CAh +PBqBtBh 



These are the functional responses assumed in standard two 
prey models (e.g., [9,20,25]) given in our notation. For instance, if 
we normalize searching time so that NTs = 1, fA(qA,qB) can be 
rewritten in terms of prey density in the more familiar form 
xqA 

. As mentioned above, it is assumed that the 

l+xqAiAh+yqBT:Bh 

encounter rates, Pa and ps, remain unchanged over the renewal 
cycle in that predation has negligible effect on prey densities 
during this time. This occurs if, for example, x and y are large or 
N is quite large and so predation is rare. Our decision tree 
approach provides a mechanistic foundation to typical functional 
responses assumed in the literature. In particular, it is obvious that 
the standard HoUing II functional response [2] given by 

fix) = is the outcome for Figure 1 when there is only 

1 +XTh 

one type of prey and the predator always pursues every prey it 
encounters (take NXs = 1,J' = 0 and (/^ = I). 

The predator's rate of energy gain,/, is given by (Figure 1) 



fiqAAB)-- 



PAqA-^A +PBqBTlB 

+ Pa qA ^Ah + Paqs^Bh 



(1) 



Like others [9,20,23,26], we assume that the forager aims to 
maximize /. This theory predicts that if the two types of prey are 
ranked according to their "profitabilities" (i.e. their respective 

nutritional values per unit of handling time n/z), then the more 
profitable prey type is always included in the diet. That is, if 
T^A/'^Ah>T^B/'^Bh, then the optimal foraging strategy is to attack all 
encountered prey A (i.e. qA = 1). Furthermore, the decision to 
attack the lower ranked prey (i.e. prey B) satisfies the zero-one rule. 
Specifically, (/_g = 1 (respectively, qB = Q) if its profitability is greater 
than (respectively, less than) the nutritional value of only attacking 



Its 



Patia 



The 



prey of type A (i.e. = 1 if and only if > 

ZBh Zs+pATAh' 

threshold value for including the less profitable prey in the 
predator's diet depends only on the chances of encountering the 
more profitable prey (i.e. only on the density of prey A) since 
g's = 1 if and only ii'pA<PA where 



Pa- 



TIbZs 



nAT^Bh—T^BtAh 



(2) 



[9,20,23,26]. 



Decision trees and extensive form games 

The decision tree approach is reminiscent of games given in 
extensive form [18,19]. Because of this relationship between 
decision trees and extensive form games, game theory can then be 
used to find the optimal foraging strategy. First, we use the 
truncation method to eliminate those paths that always yield 
suboptimal outcomes. When applied to Figure 1, truncation 
removes the dotted path of rejecting the opportunity to capture 
prey type A. It is never optimal to reject the prey that offers a 
higher reward to handling time ratio. But what of node B? For 
food B with a lower energy to handling time ratio, we can find the 
optimal foraging strategy by analyzing the agent normal form 
[19]. This method assigns a separate player (called an agent) to 
each decision node. The possible decisions at this node become the 
agent's strategies and its payoff is given by the total energy intake 
rate of the predator it represents. When game theory is used to 
solve a single predator's decision tree, all of the virtual agents have 
the same common payoff, and in a sense, these agents engage in a 
cooperative game. The optimal foraging strategy of the single 
predator is then a solution to this game. 

To illustrate the approach, we make the decision tree of Figure 1 
into a two-player foraging game. Player 1 corresponds to decision 
node A with strategy set S\={qA\Q<qA<^} and player 2 to node 
B with strategy set S2 = {qB\Q<qB<^}- Their common payoff 
f(<lA,qB) is given by (1). In an extensive form game, the payoff 
functions are linear in the behavioral strategv' choices of all players. 
For our optimal foraging games, these payoffs are nonlinear 
functions and so are more similar to those found in population 
games [27,28]. As a game, we seek the Nash equilibrium (NE). 
This is a pair of behavioral strategies {qA,qB), one for each player, 
such that neither player can gain by unUaterally changing its 
strategy. That is. 



f(qA Ab) <f{qA Ab) and f{qA ,qB) <f{qA Ab) 



(3) 



for all qA^Si and q^eSi. In game-theoretic terms, (qA,qB) is a NE 
if is a best response of player 1 to and is a best response of 
player 2 to qA- 

Clearly, an optimal foraging behavior {qA,qB) {filA^qB)^ 
fiqA^qs) for all q^eSi and qgeS2} corresponds to a NE since it 
satisfies (3). Solving the game (i.e. finding the NE) for the classic 
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foraging model of two types of prey is straiglitforward. Since 
f {\,qB)>f (qA,qB) for all 0<9^ < 1 and G<qB<^, the behavioral 
strategy of player 1 to attack (i.e. (/^ = 1) strictly dominates all its 
other options (i.e. 1) and so, at any NE, player 1 must play 
qA = i- The NE strategy of player 2 is then any best response to 
qA = l (i.e. any qs that satisfies /(I ,?^) </(l ,9^) for all 0 < g'^ < 1). 
A short calculation yields 

r 0 iipA>P*A 
qB=< 1 i^PA<PA (4) 

[[0,1] iipA=P*A, 

where is given by (2). These results are shown in Figure 2 where 
NE are indicated by solid circles (panels (a) and (c)) and by the soUd 
line segment on the right c-dge of panel (b). In this latter case (i.e. 
when Pa = Pa), every point on this vertical edge = 1 is a NE and 
the entire edge forms a NE component (i.e. a maximal connected 
set of NE, cf [19]). Thus, at this critical encounter rate with the 
more profitable prey type, the zero-one rule of optimal foraging 
which states that a given resource type in a given patch is either 
always consumed when encountered or never consumed, must be 
modified because the optimally foraging predator preference for 
the alternative prey type can be anywhere between 0 and 1 . 

Since Figure 1 is a two-level foraging game. Theorem 3 of 
section Zero-one rule and the Nash equilibrium of Appendix S 1 
implies that the NE given by Figure 2 (i.e. by = 1 and qs given 
by (4)) completely characterize optimal predator foraging behav- 
ior. Figure 2 also indicates the direction of increasing energy intake 
per unit time at points in the unit square. This suggests yet another 
connection to game theory; namely, how does the predator learn 
its optimal behavior? This question is commonly studied in 
evolutionary game theory [19,29] where individual behaviors 
evolve in such a way that strategies with higher payoff become 
used more frequentiy. By following the flow of increasing payoff in 
the figure, it is clear from Figure 2 that such an evolutionary 
process will automatically lead to optimal predator behavior. We 
will return to this question in section Game theory and 
evolutionary outcomes for the prey recognition game where the 
evolutionary outcome is not so clear. 

In these more general games where the decision tree has more 
than 2 levels, there may be NE that do not correspond to optimal 



foraging behavior. However, so long as the number of encounter 
events at level 1 and predator activities remain finite, these 
decision trees generate the predator's energy intake rate and its 
functional responses on each type of prey. Game-theoretic 
equilibrium selection techniques [30] based on evolutionary 
outcomes can then be used to discard suboptimEil NE behaviors 
and select only those NE corresponding to optimal foraging 
behaviors as we will see in the final example that includes prey 
recognition effects (see section Prey Recognition Effects). 

Results 

Foraging with simultaneous resource encounters 

In this section, we again assume that there are two resource 
types (denoted as A and B) but, unlike section Decision trees and 
the functional response for two prey types, some microhabitats can 
contain a mixture of both types (denoted as AB). In this case, we 
assume that the consumer can forage for at most one resource type 
in any encounter event. Other microhabitats can be resources free. 
Furthermore, let Pa,Pb and Pab respi'cti\ (-ly be the proportions of 
these microhabitats that contain only resourc:e A, only resource B 
prey and both resources AB respectively. Finally, let po be the 
proportion of microhabitats that contain no resources. If the 
consumer chooses a patch at random, the distribution of 
encounter events is given by Level 1 of Figure 3. 

Figure 3 also contains the distribution of consumer activity 
events under the assumption that the consumer is always successful 
when it decides to forage a resource that it encounters. In the 
predator-prey interpretation, this means the predator kills its prey 
whenever it attacks. As discussed in the final paragraph of section 
Decision trees and the functional responses of Appendix SI, our 
decision tree approach to optimal foraging is also applicable when 
the attacking predator is only successful with a certain probability 
that may depend on the type of prey. Here qA (respectively, qB) is 
the probability the consumer forages for the resource when it 
encounters only resource type A (respectively, type B). Also CjjiB 
(respectively, qsA) is the probability the consumer forages type A 
(respectively, type B) resource when it chooses a microhabitat that 
contains both types of resources and so l—qAB — qBA is the 
probability the consumer decides not to forage for either resource 
in this encounter event. 



P*l < PA (a) p\ = PA (b) P*! > PA (c) 

> 1 ll > 1 ll ^ 



QB 



r 



■> — i 



QB 



QA 




QB 




Figure 2. Qualitative outcomes of the optimal foraging strategy for the classical foraging model (1) with two prey types as a 
function of the encounter probability with the most profitable prey (i.e. of py,). Panel (a) assumes that 1 >pa >PA = TiBT:,:/{KATBh — ''iB'':Ah) in 

which case the optimal strategy and NE is (qA,gB) = {^fi)- In panel {c),pA <p'^ and the optimal strategy (and NE) is (f/^,^g) = (l,0). The arrows in each 
panel indicate the direction of increasing energy intake per unit time at points in the unit square. For completeness, the figure also includes the 
threshold case, panel (b), where pa = p^ (i.e. the density of A prey is at the switching threshold). Although this case is rarely considered by ecologists, 
its inclusion here is important to understand the optimal outcomes in our more complicated models. In panel (b), the optimal strategy is 
(<lA,qB) = (i,y) where 0<v<l, corresponding to the solid right-hand edge of the unit square that forms a set of NE points. 
doi:1 0.1 371/journal.pone.0088773.g002 
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Qba'-. I-qab-Qba 



TTA TTB 0 

Ts + TAh Ts + TBh T s 



Figure 3. The decision tree for the simultaneous encounter game. At optimal foraging, two edges of this tree diagram are never followed. 
These are indicated by dotted lines in the tree. The reduced tree is then the resulting diagram with these edges removed. 
doi:1 0.1 371/journal.pone.0088773.g003 



The functional response can then be developed from the 
decision tree in Figure 3 that includes the searching and handling 
times as well as the energy intakes of the different activity events. 
Proceeding as in section Decision trees and functional response for 
two prey types, the functional responses to resource type A and B 
are given by 



fA(gA,qAB,gBA,gB) 
fsigA ,qAB,qBA,qB) 



PaQA +PABqAB 



PABqSA +PBqB 



(5) 



respectively, where T = Ts+PAqAi:Ah+PABqAB-':Ah+PABqBAT:Bh + 
PBqB'^Bh- Thus the total consumer energy intake per unit time is 



r, . {pAqA+PABqAB)nA + (pABqBA+PBqB)nB 
fiqA,qAB,qBA,qB)= ' ' — (6) 



To find the optimal foraging strategy, we solve for the NE of the 
three-player game that assigns one player to each of the consumer 
decision nodes in Figure 3. As shown in section Foraging with 
simultaneous resource encounters of Appendix S 1 , the behavior 
strategy to consume resource A at node A strictly dominates all 
other actions of this player (i.e., f{\,qAB,qBA,qB)> 
fiqA,qAB,qBA,qB) for all 0<g^<l), as we assume that resource 
A is more profitable to the predator than resource B (i.e. that 

> ). It is also shown there that any behavior strategy at 

node AB whereby a resource is not always con.sumed (i.e. 
qAB + qBA<^) is strictiy dominated. Thus qA = ^ and 
qAB + qBA = i at any NE. 

From these two results, the decision tree in Figure 3 can be 
truncated by deleting the two edges indicated by dotted lines. With 
this change, the consumer energy intake rate / becomes 



\ (j>A+PABqAB)nA + {pAB(i-qAB)+PBqB)nB 
/ (qAB,qB) = (. I) 



where now x = %s+PAtAh+PABqABT:Ah+PAB{^—qAB)T:Bh + 

PBqB^Bh- 

Thus, the optimal strategy is a NE of the two-player game 
corresponding to the reduced tree of Figure 3. In this two-level 
foraging game, player 1 corresponds to decision node AB with 
strategy 0 < qAB < 1 and player 2 at node B with strategy 
0<95<1. Their common payoff is given by (7). From section 
Foraging with simultaneous resource encounters of Appendix SI 



the best response for player 1 that encounters both prey types 
simultaneously given the current strategy of player 2 is 



qAB- 



0 ifqB<q*B 

1 if qB>qB 
[[0,1] if qB = q'B 



(8) 



where 



t _ (tIB — 7t>l)T., — (Pa+ PAb){tIA T:Bh — '"■B'CAh) ,c.\ 

— 7 \ ' \ / 

PBiTlA'^Bh — 'n^BtAh) 



Similarly, the best response of player 2 when encountering only 
resource B is 



1 if qAB<q*AB 
qB=i 0 \fqAB>q'AB 

[[0,1] ifqAB = q*AB 



(10) 



where 



Iab-- 



^BT^s —PAiTlAtBh — nsTAh) 
PABiTlAT^Bh — nBtAh) 



(11) 



Then {qAB,qB) is a NE if and only if this strategy pair satisfies 
erjuations (8) and (10). Thus, unlike section Decision trees and 
functional response for two prey types, the NE behavior at one 
consumer decision node depends on the behavior at the other. 

By Theorem 3 in section Zero-one rule and the Nash 
equilibrium of Appendix SI, NE correspond to optimal foraging 
behavior. Thus, the optimal foraging behavior depends critically 
on the values of and q'g. In particular, it is important to know 
whether these values are between 0 and 1, less than 0 or greater 
than 1 . For instance, suppose that q^g > 1 and 0<q%<l. Then, 
from (10), qB = i (since qAB<q*AB) and so qAB=^ by (8). In this 
case, the only optimal foraging behavior is to consume A 
whenever it is encountered and to consume B only when it is 
not encountered simultaneously with A. In general, we observe 
that (i) if q*ig<Q, then ?^<0, and (ii) if ^^>0, then q*j^g>l. 
These inequalities constrain the number of possible optimal 
strategies to ((7^g,(7B)e{(l,0),(l,l),(0,l)}. These are the possible 
optimal strategies among the vertices of the unit square in Figure 4. 
As we win see, at certain threshold parameter values, two of these 
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vertices can both correspond to optimal behavior. In this case, all 
points on the edge between these vertices correspond to optimal 
behavior as well. In particular, the case where qAB = lB = 0 can 

never occur because the two necessary conditions q*^g < 0 and 
qg>0 are excluded by (i) and (ii). This intuitive result predicts that 
if the less profitable resource type B is consumed when 
encountering both types (i.e., if qsA = 1 which implies that 
'}AB = 0), it will always be consumed when encountered alone. 
Moreover, there is no interior optimal strategy (i.e. there is no NE 
whereby the consumer exhibits partial diet choice when encoun- 
tering both resource types simultaneously as well as partiad diet 
choice when encountering resource B on its own) since this 
requires that both and be strictly between 0 and 1 , which 
does not happen for any parameter choice. 

It is interesting to analyze dependence of the optimal strategy 
((1ab,<1b) on the energetic value (ub) of the less profitability prey. 
We consider only those energetic values for whicli B is the less 

profitable prey type (i.e., 0<nB< ^ ^'' ). To this end, we need to 

know the critical values of %b when either q'^ or q*^g are equal to 0 
or 1. Let 



» ^ t^aUpa +Pab +Pb)t:bi, + -^.0 , _ TtA((PA+PAB)T:Bh+T:s) 

(PA+PAB+PB)T:Ah+T:s ' iPA+PAB)T:Ah + T:s 

, _ (PA +PAB)nA'':Bh t _ PA^AT^Bh 

^ Ft'^ — 7 \ ' ^ ft4 — • 

(Pa +PABnAh + PAT:Ah + T^s 

Then g;^(7t^i) = ?^a(7t;^3)= 1 and g*B{'^*B2) = lABi^B4) = ^- We 
divide the analysis into two cases. For the first, assume that the 
handling time of resource A is longer than or equal to that of 
resource B (t^/,>T£/,). As we assume that resource A is more 
profitable than B, it follows that the energy content in food items 
must be larger in resource A [tza >tzb) and so < 0 by (9). Thus, 
the optimal foraging strategy is to consume the A resource when 
both are encountered (i.e. g^B = l). Furthermore, qB = 0 if 

0<7i^<7r^3 and ^^=1 if 71^3 <tib< (i.e. the B resource 

T^Ah 

is consumed when encountered on its own only if its energy value 
is sufficiently high). The dep(;ndcnic of the optimal strategy as a 
function of prey B energetic value is shown in Figure 5A. 

The more interesting case where prey A handling time is shorter 
than prey B handling time ('CAh<'^Bh> Figure 5B) is analyzed in 



0 < TTB < T^Bi 




TB3 < < 

> T 



(c) 



Qab 



Qab 



Qab 



QB 

Qb 



1T%2 < TB < ir*Bi TTbi < TTB < 

>w , i d) ^ (e) 



■J 



qab 



1 0 



I . -» — ' 



qab 



Figure 4. All qualitative outcomes of the optimal foraging strategy (8) and (10) with parameters 

x, = l,Xy[i, = \,TBh = 2,nA=2,pA = 'i/4,pAB = ^/2,PB = ^/'i. In these plots, the energetic value tib of resource B varies in the interval from 0 to 5^ 
(i.e. 0<7i;s<4). The critical values of ttb are Tt^i =3; 7i^= y; ^*b^ = y; icg^ = |. The arrow/s in each panel indicate the direction of increasing energy 
intake per unit time at points in the unit square. In each case shown, these arrows lead to a single vertex indicated by the filled in circle which 
corresponds to the optimal foraging behavior (and unique NE). (a) For nB<% <fAB<^ ^""^ 9s <0- ^^^^ (Iab = ^ and qB = 0. (b) For 5<7Cs<y, 
0<q*^B<^ and 9s <0. Thus qAB = '^ and 95 = 0. The dashed line denotes q'^B- As hb increases, moves to the right until it coincides with the 
vertical line qAB = ^ when 71^ = 71^3 = y. At this critical value of ttb (not shown), all points (1,?b) on this vertical line are optimal foraging strategies 
(and NE). (c) For ^ <iib<Y' '?^b> 1 and ?b<0. Thus qAB = ^ and gB = 1- (d) For ^ <7rB<3, 9^s>l ''"d 0<q*B<l. Thus ?^s = l and ^s = l. The 
dashed line denotes q*B. (e) For 3<jtB<4, q*AB>^ and q*B>^- Thus qAB = 0 and 9b = 1- 
doi:1 0.1 371/journal.pone.0088773.g004 
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1.0 
0.8 
0.6 
0.4 
0.2 
0.0 



Qab 



QB 



B 



" D3 



'Bl 



Figure 5. Dependence of the optimal foraging strategy qab (W, 
solid line) and qs ((10), dashed line) on the energy content of 
the less profitable prey type B. Panel A assumes a larger handling 

time of prey type A (t^/,=2, tb;, = 1), while panel B assumes the 
opposite case (t^a = 1, tb/,=2). Other parameters Tj = 1, 7c^=2, 

PA = l/'i,PAB = l/2,PB=1/4. 

doi:1 0.1 371/journal.pone.0088773.g005 

section Foraging with simultaneous resource encounters of 
Appendix SI. When energy content of prey B is smaller than 
7C^ the optimal strategy is (9^b,?b) = (1,0). For intermediate 
energy content satisfying 7t^ <nB< 7t^, the optimal strategy is 
i<lAB!<lB) = i^A)- For relatively large energy content 
n*g^<-KB<nAT:Bli/'':Ah, the optimal strategy is (9^^,95) = (0,1). 

These results are also included in Figure 4 that in addition 
provides the direction of increasing energy intake per unit time at 
all points in the unit square. In all cases analyzed in the previous 
two paragraphs, the outcome satisfies the zero-one rule (i.e. either 
always consume a given resource type in a given patch or never 
consume it) as suggested by [31]. 

It is particularly interesting to see what happens at the critical 
values 71^3 where q^B~^> "-51 where ^'^ = 1- These values 
correspond to transitions (b)-(c) and (d)-(e) respectively in Figure 4 
because the dashed vertical (panel (b)) and horizontal (panel (d)) 
lines respectively are then on the boundary of the unit square. 
Straightforward calculations show that 



/(l,9/*) = 



(PA+PAB)nA 



{Pa+Pab)'^a + t:s' 
when Kb = n*g^. Thus f{\,qB) is independent otqB and the optimal 



foraging behavior is any strategy pair of the form {qAB,<lB) = {^,(lB) 
ior Q<qB<\. 

Similarly, when 7t5 = 7i^j, 



f{qAB,l)-- 



(PA+PAB+PB)TtA 

(Pa +Pab +/'b)t^ + t,- 



and the optimal foraging behavior is any strategy pair of the form 
{qAB,qB) = {qAB,^) for Q<qAB<^- Once again, the zero-one rule 
must be modified at these critical values. For instance, when 
7ts = 7C^i, resource B is always consumed under optimal foraging 
when encountered on its own. Howe\'er, if both resources are 
encountered simultaneously, optimal foraging occurs for any 
preference for the less profitable prey type. In section Zero-one 
rule and the Nash equilibrium of Appendix SI, the modified zero- 
one rule states that there is at least one optimal foraging behavior 
where the corresponding NE is a pure strategy, i.e., where the 
predator preference for a prey is either 0 or 1. After such 
modification the zero-one ruk- holds (;\ en at tib = t^*b\ because the 
(pure) strategy (?.45,?a) = (l,l) is optimal. This extension of the 
zero-one rule applies to situations where optimal preferences for 
prey types as a function of a parameter switch suddenly at some 
critical values from 1 to 0 or vice versa. 

These results can be partially explained through the patch 
choice model of [31]. Specifically, since patches A and AB have 
. . . TlA 

the same maximum profitabilities (which is higher than in 

T^Ah 

patch B), both are included in the consumer's diet. However, as 
shown by [31], this does not mean that the most profitable 
resource is chosen in patch AB. From (9), we see whether q^B = 0 
or 1 depf;nds both on the ranking of A and B profitabilities (the 
denominator in (9)) as well as on the difference in energy gain 
T^B — ^A per unit consumed. When resource type A is both more 
profitable and also has a higher energetic value (7I^>7Ib), or 
search time is short, then q'g<Q and, consequently, (?^s=l, i.e, 
only rc'source A will be consumed in patches containing both 
resource types. Only when type B is energetically more valuable 
than type A and either search time is long enough, or the 
probability of encountering patch A and patch AB is low enough, 
can resource type B be selected when both resources are 
encountered simultaneously. In this case, resource B will also be 
consumed when encountered on its own. 

Prey recognition effects 

The functional response developed in section Decision trees and 
functional response for two prey types assumes the predator 
immediately recognizes the type of prey found during its search 
and then decides whether or not to attack it. In this section, we 
model the situation where the predator cannot distinguish the type 
of prey it encounters unless it is willing to spend extra 
"recognition" time Xr beyond the time required to search the 
microhabitat. That is, the predator has an option of paying this 
extra cost to gain information on the prey type (;nc()unt(;r(;d before 
it decides whether to attack. This information is said to be 
gathered in the facultative sense [32]. Koder and Mitchell [32] 
point out instances of facultative information that occur in host- 
parasite and in mate selection models. They also discuss optimal 
foraging when information is gathered in the obligate sense (i.e. 
the predator gathers this information on prey type in the process of 
handling the prey and has the option of rejecting it at that point). 
Although we do not consider the model for obligate information in 
this paper, its decision tree (and analysis of optimal foraging) is 
simpler than Figure 6 for facultative information. 
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QM 



0 

Ts 




TTl 



QM 



Recognized Prey 1 



QAl 



TTl 

- Tlh + Tlr 



0 




7r2 



Recognized Prey 2 



?A2 




7r2 

Ts + r2h + r2r 



1 - 9A2 

0 

Ts + T2r 



Figure 6. Decision tree for prey recognition game. In the reduced tree, the dotted edges are deleted. 

doi:1 0.1 371/journal.pone.0088773.g006 



As in section Decision trees and functional response for two prey 
types, we assume that the two prey types are distributed among A'^ 
micohabitats with at most one prey in each. To (;ase notational 
difficulties, we now label these prey types as species 1 and 2 with 
densities xi and X2 respectively and nutritional values ni and 7C2 
respectively to the predator. If the predator chooses a microhabitat 
at random, the encounter event distribution (see Figure 6) is the 
same as in Figure 1 (with our change of notation). In particular, 
P0 = 'i^-j/-j/,Pl = j/S.ndp2=^. 

On finding a prey in a microhabitat, the predator decides 
immediately whether to attack, move to another microhabitat to 
begin a new search, or spend recognition time to determine the 
type of prey encountered. Suppose these choices are taken with 
probabilities qA, qui, qn respectively (where qA+qM + qR = ^- 
The horizontal dashed line in Figure 6 joining these two encounter 
events indicates that this decision must be made without knowing 
the type of prey. Thus, in the terminology of extensive form games 
[19], the set of these two nodes forms an "information set" of the 
predator and is represented by a single player in the three-player 
game corresponding to Figure 6. We remark that the two nodes 
that form this single information set require only one player 
because, at both nodes, the information available is the same (the 
information is that the searching predator encountered a prey). 

If the predator decides to spend recognition time to determine 
the encountered prey is of type i, then it must subsequently decide 
whether to attack this prey or not with probabilities qAi and 1 — qAi 
respectively (see the third level of Figure 6). It is not necessary that 
qAl =qA2- If we assume that the predator is always successful when 
attacking a prey, the tree diagram is given in Figure 6 where X\h 
and ■i2h are the handling times for prey of type 1 and 2 
respectively. We also assume that the time needed to recognize 
either type of prey is the same (i.e. Tir = T2r = 'Cr). Proceeding as in 
section Decision trees and functioned response for two prey types, 
the functional response to prey type i is given by 



f((gA,qR,qM),qAi ,qA2) = 



pini(qA+ qRqAx) + P2n2iqA + qRqA2) 



fi{{qA,qR,qM),qAuqA2) = 



PiiqA+qRqAi) 



where t = ti/, +PiqR(qA\T^ih + ) +P2qAT:2h +P2qR{qA2'^2h 

+ Tr). Thus the total predator nutritional value per unit time is 



for fixed prey distribution p\ and P2. 

The optimal predator foraging behavior corresponds to the 
maximum of / as a function of qA, qu, qR, qA\, qA2- This 
maximum is considerably harder to determine than in section 
Decision trees and functional response for two prey types. 
However, game-theoretic methods to solve for NE are effective 
at simplifying the analysis. Figure 6 is a three-player foraging game 
with player 1 representing the predator decision at the two-node 
information set at level 2 and players 2 and 3 assigned to the 
respective decision nodes at level 3. From section The Nash 
equilibria of the prey recognition game of Appendix SI, any 
strategy (qA^qR^qu) of player 1 with qM>0 is strictly dominated 
and so any NE behavior of this player must satisfy qM = 0. Thus, 
at the optimal strateg)-, the predator should never move to another 
microhabitat when it first finds a prey since, by abandoning this 
prey, the predator wastes the time spent searching for it. In 
contrast to section Decision trees and functional response for two 
prey types where the predator could reject the prey type with low 
profitability on first encounter, this is not possible here without 
rejecting the better prey type as well (hc'eaus(- upon an initial 
encounter the predator does not know the prey type). 

Since player 1 has strategy of the form (?^,?jj,?m) = 
iqA,i —qAfi) for some 0<^^ < 1, we will denote the NE behavior 
of player 1 by qA and assume that qM=0 from now on. Section 
The Nash equilibria of the prey recognition game of Appendix SI 
also shows that, if prey type 1 is more profitable than type 2 (i.e. if 

— - > as in section Decision trees and functional response for 

two prey types, then the predator must attack any prey 1 that it 
recognizes. We will assume this throughout this section. Thus, we 
will also assume that qAi = l in the decision tree of Figure 6 and 
analyze the truncated foraging game that eliminates the three 
edges indicated by dotted lines there. 

The reduced tree corresponds to a two-player game with 
strategy set 0 < < 1 for player 1 at level 2 and 0 < qAi ^ 1 for 
player 2 representing the predator decision whether to attack a 
recognized prey 2 at level 3. The energy intake rate is then 
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T^i <Tr< T*. 



qA2 




qA2 




Figure 7. Qualitative outcomes of the optimal foraging strategy (13) and (14) for increasing recognition time r,. Panel (a) assumes 
0<Tr < J for which 0<q'^2 < 1 I'a The optimal foraging strategy is at (t/^,^^2) = (0,0) (i.e. always pay the cost of recognition and then never 
attack the less profitable prey type) and the NE component (shown as the gray line segment) {{1,(Ia2)\1a2 — '1a2<^ (corresponding to the NE 
outcome of attacking immediately) is suboptimal. In each of the other three panels, the (union of the) thick edges forms a strict equilibrium set (SES, 
for definition see section Zero-one rule and the Nash equilibrium of Appendix SI) that is the globally stable evolutionary outcome. Panel (b) assumes 
tr= g, g^2='0 ^"'^ 1*A<^- The union of the two edges {(1 ,qA2)\0 <qA2 < 1} and {(^^,0)|0<^2 ^ 1} forms one NE component corresponding to 
optimal foraging behavior. Panel (c) assumes g j, ?^2<0 and <0. The edge {(1,9^2)|0<?^2 < 1} forms a NE component corresponding to 
optimal foraging behavior. Panel (d) assumes ■Cr>^ for which qj2<0 and 0<q*^<l. The edge {(1,?^2)|0<9^2<1} forms a NE component 
corresponding to optimal foraging behavior. The arrows in each panel indicate the direction of Increasing energy Intake per unit time at points In the 
unit square. Other parameters Tj = 1, tia = 1, T2a=2, 7ci=2, 7C2 = 1 and pi = l/2=p2. 
dol:l 0.1 371/journal.pone.0088773.g007 



f{qA,qA2) = 



Pini +P2n2iqA+(l-qA)qA2) 



(12) 



where 

T = T, +P1T11, +P2T:2h{qA + (1 - qA)qA2) + (Pl +;'2)Tr(l " 9^)- 

The NE of this truncated game is easy to determine when 
^ > t .+piTu ' ^^^^ situation, the profitability of prey type 2 is at 
least as high as the nutritional value of only attacking prey type 1 
when there is no recognition time (i.e. Tr = 0). In section Decision 
trees and the functional response for two prey types (of equation 
(1)), the NE behavior of player 1 is then to attack any prey 
encountered and this continues to be the NE strategy when 
recognition time is non-zero. Thus qA = ^ at any NE and, in fact, 
all NE are of the form {qA,qA2) = (^,qA2) for some Q<qA2< 1 (see 
section The Nash equilibria of the prey recognition game of 
Appendix SI for the formal derivation). This corresponds to the 
predator being opportunistic (sensu [32]). Note that, if the 
predator immediately attacks an obserx^ed prey, the decision 
whether to attack after recognizing the type of prey is no longer 
relevant since this choice is never needed. 



For the remainder of this section, assume that the profitability of 
resource 2 is lower than is the mean energy intake rate obtained 
when feeding on the more profitable prey type only (i.e.. 



712 



P\Tl\ 



In this case, the predator should consider 

whether to determine- tlu; pr(;y type' it encountered, because 
including the less profitable prey type in its diet may decrease the 
mean energy intake rate. 

To calculate the NE behavior, we proceed as in section 
Foraging with simultaneous resource encounters. From section 
The Nash equilibria of the prey recognition game of Appendix S 1 , 
the best response of player 1 to a given strategy qA2 of player 2 is 



0 iiqA2<qA2 

qA = { 1 'i't: qA2>q*A2 

[[0,1] iiqA2 = q*A2 



(13) 



where 
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?^2 = 



(Pl +P2)Tr(pi7ri +^2712) 
;^2[/^l7tlT2A - 7t2(Tj +/'1T1a) 



Conversely, the best response of player 2 to a given strategy 
of player 1 is 

r 1 iiqA<q'A 
qA2=l 0 i{q*^<qA<l (14) 
[[0,1] if 9^ = 1 or qA = q*A 

where 



Ia 



PiniZlh — 'tli'ts+Pl'^lh) 
Ttlipi +/'2)Tr 



That is iqA^qAl) is a NE foraging behavior if and only if it 
satisfies (13) and (14). We remark that q*A and q*A2 are both less 
than 1, and q*Ai<^ when (i<q*^<\. 

When recognition time, T^, is small (t^ < T*j = 

;'2(/'l (711^2/1— K2T1a)—K2T5) . , , • 

— ; r-. r — ), (7w IS negative and q J^ is positive 

(pi+P2){-Kip\+n2P2) " 
(Figure 7(a)). There are then two possible NE outcomes; namely, 
attack any encountered prey immediately corresponding to the NE 
{qA,qA2) = {^,qA2) where qA2^qA2 (shown as the gray segment of 
the line in Figure 7(a)) or never attack immediately and then only 
attack prey type 1 when recognized (with NE iqA,qA2) = (0,0), the 
sohd dot in Figure 7(a)). From section Zero-one rule and the Nash 
equilibrium of Appendix SI, the optimal foraging behavior must 
be a NE outcome but because our decision tree has three levels, 
every NE may not be an optimal strategy. However, even in this 
case finding all NE substantially simplifies the problem of finding 
the optimal strategy, because it is now enough to evaluate function 
/ given by (12) only at these NE points. Moreover, if there is a NE 
component (such as the gray segment in Figure 7(a)) the value of/ 
at any point in this segment must be the same. By evaluating 
Pi^i , m \ Pini+P2n2 



/(0,0) = 



and/(l,^^2) = 



T:s+P\'^lh + (p\+P2)T:r - - ' 'Cs+piTil,+P2m 

we find /(0,0)>/(l,^^2) and so the optimal behavior is to never 
attack immediately and then only attack prey type 1 when 
recognized. As recognition time increases, q*^ increases and q*^2 
decreases. However, the NE outcomes and optimal behavior 
remain the same as long as g^2>0- Optimal predator behavior is 
then either described as being selective [32] or as being intentional 
[33]. 

For recognition time satisfying Tr = T*[ , q*^2 = 0 while q*^ 
remains negative and so all strategy pairs of the form 
i^,qA2) and (^^,0) are NE (Figure 7(b)). Moreover, each 
corresponds to an optimal foraging strategy since 

f(qA,0)= — ■ — ■ — =fi\,qA2) in this case. 

For still larger recognition times, q*^2 < 0, thus qA = i at any NE. 

f , /'l(?tlT2A — TTlTl/,)— 7t2Ts , „ , , 

When T*. < T, < T*2 = — ^ T , 9*1 < 0 and the 

■t2{Pl +P2) 

corresponding optimal foraging behavior is shown in Figure 7(c), 
while for recognition time larger than T*2, >0 (Figure 7(d)). In 
both cases, the NE strategy pairs are of the form (1,^^2) and these 
all yield optimal foraging behavior. 



Game theory and evolutionary outcomes for the prey 
recognition game 

The existence of suboptimal NE in the prey recognition game 
makes the interesting question considered briefly in section 
Decision trees and extensive form games even more important 
here; namely, how does the predator manage to learn its optimal 
behavior and avoid suboptimal equilibrium behavior. This type of 
question (on the so-called equilibrium selection problem [30]) is 
commonly studied in evolutionary game theory where individual 
behaviors evolve in such a way that strategies with higher payoff 
become used more frequendy. There are several standard models 
that examine the evolutionary outcome of these behaviors 
changing over time [29,34]. 

The evolutionar)' outcome is clear for all choices of parameters 
in the two diet choice models of sections Decision trees and the 
functional response for two prey types and Foraging with 
simultaneous resource encounters (see arrows in Figures 2 and 5, 
respectively). These arrows indicate the direction of increasing 
energy intake rate (e.g. in Figure 4, this rate* increases as q^ is used 
more frequently if and only if the vertical arrow is pointing 
upward). In all cases, the predator learns to use the NE strategy 
that corresponds to the optimal behavior for the foraging games of 
Figures 1 and 3 respectively. (This is true for Figure 2b as well 
since the arrows lead to some point on the vertical side of the unit 
square with (/^ = 1, all of which correspond to optimal behavior in 
this threshold case when Pa=Pa-) 

The evolutionary outcome is also clear for the prey recognition 
game of this section when recognition time is large from Figure 7. 
Specifically, for T,. >T*j (panels (b), (c) and (d)), the predator will 
evolve to a strategy on an edge consisting of NE points which 
correspond to optimal foraging. In the language of evolutionary 
game theory, this set of NE forms a globally stable set that attracts 
any initial predator behavioral choice as long as behaviors evolve 
in the direction of increasing energy intake rate. 

However, for short prey recognition time (i.e. T < T*[ with 
0 < q'^2 < 1 in Figure 7(a)), the NE (0,0) (corresponding to the 
optimal foraging behavior of always spending the time to 
recognize the type of prey encountered and then only attacking 
prey of t^-pe 1) may not be globally stable. If the predator initially 
attacks recognized prey 2 with probability greater than q*^2> 
behavior may evolve to a point in the suboptimal NE component 
where (qA,qA2) = (1)9^2) with qA2 ^ qA2 (i-^- to a point on the gray 
hne segment in Figure 7(a)). That is, the predator may become 
trapped at this suboptimal behavior, especially if evolution 
increases the strategy of attacking immediately faster than it 
decreases the strategy of attacking recognized prey 2 (i.e. if the 
arrow to the right in the top half of Figure 7(a) is much bigger than 
the one pointing down). 

The situation depicted in Figure 7(a) is remarkably similar to 
that of the two-player extensive form Chain store game [19,35] 
(also known as the Ultimatum mini-game [36] or the Entry 
deterrence game [37]). In the large literature on this game, it is 
often argued that the evolutionary outcome will be the point (0,0)) 
since neutral drift near the suboptimal NE component will 
inevitably lead at some time to the strategy choice shifting to 
qA2 < qA2 after which selection will quickly lead to (0,0). To see 
this, consider a point on this gray line segment. If the predator 
decides once in a while to spend some time to recognize the type of 
prey it encounters, its strategy wiU move to the left of the segment. 
As strategies with higher payoff are then to the right and down in 
the vicinity of the line segment, it is likely that qA2 wiU regularly 
decrease until it reaches the lower end of the segment. Any further 
strategy experimentation on the part of the predator will lead to 
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<lA2 < 9^2) after which the only evolutionary outcome can be (0,0). 
In terms of evolutionary game theory again, the suboptimal NE 
component is not stable whereas the optimal NE is. 

In summary, the optimal foraging behavior is selected in the 
prey recognition game as the NE component that is the stable 
outcome of the evolutionary learning process whether or not prey 
recognition time is short (i.e. for arbitrary Xy). The analysis of 
optimal foraging theory for this example illustrates anew the 
potential of game-theoretic methods to gain a better understand- 
ing of issues that arise in behavioral ecology. 

Discussion 

In this article, we develop a game-theoretic approach for 
constructing functional responses in multi-prey environments and 
for finding optimal foraging strategies based on these functional 
responses [9,20]. The approach here is based on methods from 
extensive form games [18,19]. The importance of these game- 
theoretic approaches for functional response is two-fold. First, 
decision trees similar to those used in extensive form games are a 
natural way to describe details of predator behavior based on the 
sequence of choices the predator makes at different decision 
points. This facilitates writing down the corresponding functional 
response. Second, we show that optimal foraging behavior that 
maximizes energy intake per unit time can be determined by 
solving the underlying foraging game for its Nash equilibrium. We 
documented these game theory methods through three examples: 
the classical diet choice model, simultaneous encounter with prey, 
and a model in which recognition time is considered. We remark 
that, although the calculation of the optimal foraging behavior in 
the first example is straightforward, it is not as easy in the last two 
cases where our game theory methods lead readily to the solution. 

Decision trees are often used in evolutionary ecolog)' to describe 
possible decision sequences of individuals in biological systems 
[28], including models of kleptoparasitism [38] and of producers 
and scroungers [39]. They have been used less often in connection 
with functional responses, even though the predation process can 
be conveniendy described by such trees (e.g., [26,40,41]). Optimal 
foraging behavior that maximizes animal fitness is then often 
described as a sequence of single choices at each decision node 
faced by the predator. Such outcomes are reminiscent of those 
found by applying the backward induction technique to extensive 
form games that also chooses one strategy at each decision node 
[18,19]. However, there are essential difierences. Specifically, 
under backward induction, the optimal choice at such a node 
depends only on the comparisons of payoffs along paths following 
this node. Unfortunately, the time constraint in our foraging game 
means that decisions at one node have payolf consequences as to 
what is optimal at another node, a connection between these 
decision nodes of the tree that has no counterpart in extensive 
form games. That is, the payoff concept for "foraging games" such 
as Figure 1 combines both the nutritional values and the duration 
of each activity given at all the end nodes of the decision tree. On 
the other hand, as shown in all three examples, the extensive form 
technique connected to backward induction of forming the 
reduced decision tree by truncating those paths corresponding to 
dominated strategies remains an effective means of considerably 
simplifying the NE analysis. 

Dynamic programming (a form of backward induction) has also 
been used to find optimal foraging behavior [23,42]. Specifically, 
the approach developed by Houston and McNamara [23] shows 
that the optimal foraging strategy' must maximize the difference 
between the expected energy intake during a single renewal cycle 
and the product of the mean optimal energy intake rate and the 



duration of the cycle. This approach specifies the optimal choice at 
each decision node provided the energy intake rate under the 
optimal strategy is known. Since the optimal choice in one part of 
the decision tree then requires knowing the overall optimal 
strategy, the solution is typically obtained by numerical iteration. 

Instead, the approach we take in this article avoids such 
numerical methods by solving the game analytically. In this game, 
virtual players (also called agents) are associated with each decision 
point. These players are virtual because their payoff is derived 
from the functional response of a single individual only. 
Nevertheless, these players play a game because their decisions 
are linked, one player's optimal strategy depends on the other 
players' decision. We showed that solving this game by finding cdl 
the Nash equilibria wUl lead to the optimal foraging strategy. In 
those cases where some NE are not optimal foraging strategies, we 
showed it is easy to select the optimal ones among them by 
calculating their mean energy intake rate. Even when the game 
has infinitely many Nash equilibria that form a segment of a line 
(such Nash equilibrium components often arise in extensive form 
games), we showed that the energy intake rate at all these Nash 
equilibria will be the same. This means that once there are a finite 
number of isolated Nash equilibria points or Nash equilibrium 
components, finding the optimal strategy corresponds to compar- 
ing a finite number of values, which is trivial. 

We documented these game-theoretic methods by applying 
them to three examples. The classic diet choice model with two 
prey types where predators encounter prey sequentially was 
considered first since it has been historically analyzed without 
game theory and yet provides an informative introduction to our 
new approach. Then we moved to a more complicated situation 
where a searching predator can simultaneously encounter both 
prey t^pes [3 1 ,43] . These authors showed that under simultaneous 
encounter the predictions based on the prey profitabilities (i.e., 
energy content over handling time) are not sufficient to predict the 
optimal foraging strateg)'. In fart, the optimal foraging strategies 
can be quite compHcated as they depend now also on the relation 
between the energy content in different food types. In particular. 
Figure 5A shows that when the less profitable prey type 2 contains 
also less energy than the more profitable prey type 1, then the 
more profitable prey t)'pe 1 will be selected when both prey types 
are encountered. However, when the energy content of the less 
profitable prey type is large (but stiU small enough that prey type 2 
continues to be less profitable), it will be preferred when both prey 
types are encountered (Figure 5B). The solid line in Figure 5B 
shows the preference for prey type A when encountered with prey 
type B. When this preference switches from 1 to 0 above 715 = 3, 
predator preference for prey t>'pe B when encountered with prey 
type A switches from 0 to 1 . All possible optimal foraging strategies 
as a function of the alternative prey type energy content are shown 
in Figure 4. In particular, it cannot happen that the less profitable 
prey t^pe is included in the predator's diet when encountered 
simultaneously with the more profitable prey type but not taken 
when encountered alone. 

The last model discussed in this article examines whether a 
predator should spend time to recognize which type of prey it 
encountered before deciding whether to attack the prey or not 
[32]. This example is more complex for several reasons, including 
the fact that the corresponding decision tree now has three 
different levels (whereas the previous two examples are described 
by two-level trees). While the NE corresponds exactly to the 
optimal strategy in two-level decision trees, this is not the case 
here. When recognition time is small, we show that there are NE 
of the optimal foraging game that lead to suboptimal foraging 
(Figure 7(a)). However, these suboptimal NE are easy to exclude 
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by equilibrium selection techniques borrowed from evolutionary 
game theory [30] . Specifically, optimal foraging is always given by 
the unique NE outcome that corresponds to the stable equilibrium 
point (or set of equilibrium points) as the predator learns its 
optimal strategy (i.e. as its strategy evolves in the direction of the 
arrows in Figures 2, 4, 7). 

This method taken from evolutionary game theory to determine 
optimal foraging behavior differs from the more traditional 
approach based on the (modified) zero-one rule. This latter 
approach can be applied to the prey recognition game. Kotler and 
Mitchell [32] show that the zero-one rule yields just two possible 
optimal outcomes: either complete opportunism or completely 
selective. Instead of analyzing for the effects of increasing 
recognition time as we have done, they concentrate on what 
happens when the abundance of the less profitable prey increases 
(which, in our notation, means pi increases). They emphasize the 
somewhat counterintuitive result that, with low abundance, the 
less profitable prey is excluded from the diet. At intermediate 
abundances it is included, and then with high abundance it is 
excluded again. 

Game-theoretic methods play an important role in the 
traditional approach as well. Specifically, because the energy 
intake rate is the same at all points of the NE component, we need 
to compare only two numbers; the energy intake rate at any point 
of the NE component (the gray line segment in Figure 7(a)) and the 
energy intake rate at the other NE point (0,0). Our analysis shows 
that, when recognition time is small, the optimal foraging strategy 
is to always pay the extra time to recognize the encountered prey 
iypc (i.e., never attack the encountered prey item immediately 
qA = 0, Figure 7(a)) and to include it in the diet if it is the more 
profitable prey type (i.e., not to include the alternative prey type 2, 
qA2 = Q)- As the recognition time increases, the optimal foraging 
strategy is not to waste time recognizing the encountered prey type 
(Figure 7c, d). In this case, all encountered prey types are included 
in predator's diet and so qAi is not uniquely defined. That is, since 
all encountered prey are immediately included in predator's diet, 
the question whether to include the recognized prey type in the 
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diet becomes irrelevant and so the preference for the alternative 
prey type is any number between 0 and 1. 

For the three optimal foraging games modeled in this paper, the 
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